Why Does Collaborative Filtering Work? — Recommendation Model Validation and Selection by Analyzing Bipartite Random Graphs

نویسندگان

  • Zan Huang
  • Daniel D. Zeng
چکیده

A large number of collaborative filtering (CF) algorithms have been proposed in the literature as the core of automated recommender systems. However, the underlying justification for these algorithms is lacking and their relative performances are typically domainand data-dependent. In this paper, we aim to develop initial understanding of the validation and model/algorithm selection issues based on the graph topological modeling methodology. By representing the input data in the form of consumer-product interactions such as purchases and ratings as a bipartite graph, we develop bipartite graph topological measures to capture patterns that exist in the input data relevant to recommendation. Using a simulation approach, we observe the deviations of these topological measures for given recommendation datasets from the expected values for simulated random datasets. These deviations help explain why certain CF algorithms work for the given datasets. They can also serve as the basis for a comprehensive model selection framework that chooses appropriate CF algorithms given the characteristics of the dataset under study. We validate our approach using two real-world e-commerce datasets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing Consumer-Product Graphs: Empirical Findings and Applications in Recommender Systems

W apply random graph modeling methodology to analyze bipartite consumer-product graphs that represent sales transactions to better understand consumer purchase behavior in e-commerce settings. Based on two real-world e-commerce data sets, we found that such graphs demonstrate topological features that deviate significantly from theoretical predictions based on standard random graph models. In p...

متن کامل

A generalized model via random walks for information filtering

There could exist a simple general mechanism lurking beneath collaborative filtering and interdisciplinary physics approaches which have been successfully applied to online E-commerce platforms. Motivated by this idea, we propose a generalized model employing the dynamics of the random walk in the bipartite networks. Taking into account the degree information, the proposed generalized model cou...

متن کامل

A New Similarity Measure Based on Item Proximity and Closeness for Collaborative Filtering Recommendation

Recommender systems utilize information retrieval and machine learning techniques for filtering information and can predict whether a user would like an unseen item. User similarity measurement plays an important role in collaborative filtering based recommender systems. In order to improve accuracy of traditional user based collaborative filtering techniques under new user cold-start problem a...

متن کامل

A New Perspective on Recommender Systems A Random Graph Theory Approach

Random graph theory has become a major modeling tool to study complex systems. We apply random graph theory to analyze bipartite consumer-product graphs that represent sales transaction data to understand purchase behavior in e-commerce settings. Using two real-world e-commerce datasets we found that such graphs demonstrate topological features that deviate from theoretical predictions based on...

متن کامل

QoS-based Web Service Recommendation using Popular-dependent Collaborative Filtering

Since, most of the organizations present their services electronically, the number of functionally-equivalent web services is increasing as well as the number of users that employ those web services. Consequently, plenty of information is generated by the users and the web services that lead to the users be in trouble in finding their appropriate web services. Therefore, it is required to provi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005